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THE TRACY WIDOM LIMIT FOR THE LARGEST EIGENVALUES 
OF SINGULAR COMPLEX WISHART MATRICES 

By Alexei Onatski 

Columbia University 

This paper extends the work of El Karoui [Ann. Probab. 35 (2007) 
663-714] which finds the Tracy-Widom limit for the largest eigen- 
value of a nonsingular p-dimensional complex Wishart matrix 
Wc(yip,n) to the case of several of the largest eigenvalues of the pos- 
sibly singular (n <p) matrix Wc(£2 p ,n). As a byproduct, we extend 
all results of Baik, Ben Arous and Peche [Ann. Probab. 33 (2005) 
1643-1697] to the singular Wishart matrix case. We apply our find- 
ings to obtain a 95% confidence set for the number of common risk 
factors in excess stock returns. 

1. Introduction. The goal of this paper is to establish the joint asymp- 
totic distribution of a finite number of properly scaled and centered largest 
eigenvalues of a p-dimensional complex Wishart matrix Wc{Q, p ,n) as both 
n and p tend to infinity in such a way that n/p remains in a compact subset 
of (0,oo). The paper extends Baik, Ben Arous and Peche [4] and El Karoui 
[9], which find the asymptotic distribution of the scaled and centered single 
largest eigenvalue of Wc(£l p ,n) under the assumption that n/p remains in 
a compact subset of [1, oo). When n/p is less than 1, the Wishart matrix is 
singular. The main contribution of this paper is extending [4] and [9] to the 
singular Wishart case. 

One need for such an extension arises in a companion paper [17], which 
develops statistical tests of various hypotheses concerning the number of 
factors in Chamberlain and Rothschild's [8] approximate factor model, which 
is widely used in empirical macroeconomics and finance. The model considers 
a double-infinite sequence of random variables i,t S N} such that, for 
any i, t 6 N, 

(1) ^it = KF t + m , 
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where Ft and Aj are /c-dimensional (k < oo) vectors of unobserved com- 
mon factors and factor loadings, respectively, and rju is an unobserved id- 
iosyncratic component of In contrast to the classical factor model (see 
[1], Chapter 14), the idiosyncratic components are allowed to be correlated 
over the ith dimension. The identification of the idiosyncratic components is 
achieved by assuming that the largest eigenvalue of the covariance matrix of 
vector {r]u}i<i< p stays bounded as p tends to infinity, whereas the smallest 
nonzero eigenvalue of the covariance matrix of vector {A^i^}i<j< p diverges 
to infinity as fast as p. These assumptions are often interpreted as formaliz- 
ing the requirements that the common factors nontrivially influence all data 
points, whereas the idiosyncratic components have only local effects. 

Researchers in macroeconomics and finance use Chamberlain and Roth- 
schild's [8] model to handle high-dimensional data sets. They interpret F t 
as a vector of factors nontrivially influencing hundreds of available macroe- 
conomic indicators (see, e.g., [25]) or, in the case of finance, as a vector 
of the risk factors common to hundreds of stock returns (see, e.g., [7]). 
An important practical question is how many such factors there are. Un- 
der the assumptions of the Chamberlain-Rothschild model, one can equiv- 
alently ask how many eigenvalues of XX' /n, where X is the data matrix 
{£,it}i<i<p.i<t<m diverge to infinity as both p and n tend to infinity in such 
a way that n/p remains in a compact subset of (0,oo). 

Using eigenvalue perturbation theory, [17] shows that if the true number 
of factors is ko, then the asymptotic distribution of the scaled and cen- 
tered (ko + l)th, (ko + 2)th, and so on, eigenvalues of XX' /n is the same 
as that of the scaled and centered 1st, 2nd, and so on, eigenvalues of the 
sample covariance matrix of the idiosyncratic components (slightly abus- 
ing a standard definition, we define the sample covariance matrix of vec- 
tors v±,...,v n as J27=i v i v i/ n )- Therefore, assuming that {r]u}i<i< p are i.i.d. 
(over t £ N) complex Gaussian iV"c(0, S p ) vectors (to make such an assump- 
tion realistic, we perform a preliminary transformation of real-valued data 
into a complex- valued form), a test (described in more detail in Section 4 
below) of the null hypothesis that the true number of factors equals ko, 
against an alternative of more than ko factors, can be based on checking 
whether the (ko + l)th, (ko + 2)th, and so on, eigenvalues of the sample 
covariance matrix of the data are drawn from the joint distribution of the 
largest eigenvalues of WcC^ P /n, n). Since, in macroeconomics and finance, 
the cross-sectional dimension p of data is often larger than their time-series 
dimension n, to obtain the asymptotic critical values of the test, we must 
analyze the joint asymptotic distribution of the largest eigenvalues of a sin- 
gular complex Wishart matrix. 

El Karoui [9] proves that the asymptotic distribution of the properly 
scaled and centered largest eigenvalue of a nonsingular complex Wishart 
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matrix, Wc(S p /n, n), is the Tracy- Widom distribution of type two (TW2). 
TW2 refers to a distribution with the cumulative distribution function 



which is asymptotically equivalent to the Airy function Ai(s) (see [15]) as 
s — > 00. It plays an important role in large random matrix theory (see [14]) 
because it is the asymptotic distribution of the scaled and centered largest 
eigenvalue of a matrix from the so-called Gaussian Unitary Ensemble (GUE) 
as the size of the matrix tends to infinity. 

The GUE is the collection of all N x N Hermitian matrices with i.i.d. 
complex Gaussian Nq(0, 1/N) lower triangular entries and (independent of 
them) i.i.d. real Gaussian N(0,1/N) diagonal entries. Let d\ > ■ • ■ > 
be eigenvalues of a matrix from the GUE. Define di = N 2 / 3 (di — 2). Tracy 
and Widom [26] studied the asymptotic distribution of a few of the largest 
eigenvalues of matrices from the GUE when N — > 00. They described the 
asymptotic marginal distributions of di, i = 1, . .. ,m, where m is any fixed 
positive integer, in terms of a solution of a completely integrable system of 
partial differential equations. If we are interested in the asymptotic distri- 
bution of the largest eigenvalue only, then the system simplifies to the single 
ordinary differential equation given above. 

In this paper, we extend El Karoui's [9] results to show that the asymp- 
totic distribution of the scaled and centered m largest eigenvalues (m < 00) 
of a possibly singular complex Wishart matrix VFc(S p /n,n) is the same as 
the joint asymptotic distribution of d\, . . . , d m . We follow [24] in calling such 
a joint distribution the joint Tracy-Widom distribution. 

Random matrix theory has developed the following powerful method of 
analysis of the joint asymptotic distribution of a few of the largest eigenval- 
ues of various random matrices as the dimensionality of the matrices tends 
to infinity. First, the joint distribution of a few of the largest eigenvalues 
is expressed in terms of the probabilities P(i\, . . . ,i m 

; J\, . . . , J m ) that dis- 
joint subsets J\, . . . , J m of the real line contain exactly ii,...,i m eigenvalues. 
The latter probabilities are then represented in the form of Fredholm deter- 
minants of operators indexed by the dimensionality of the random matrix 
under consideration. Finally, it is proved that the operators converge in the 
trace-class norm as the dimensionality tends to infinity and the correspond- 
ing limits are found. The outcome of such an analysis is an expression of the 
joint distribution of a few of the largest eigenvalues in terms of Fredholm 
determinants of the limiting integral operators. Often, kernels of these op- 
erators have a relatively simple form which ensures further detailed analysis 
of the joint distribution of the largest eigenvalues. 
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Let Xj be the jth largest eigenvalue of a complex Wishart matrix W<c(£ p / 
n,n). The first step of the above method is then performed as follows. For 
any real s±> ■ ■■ > s m > 0, 

Pr(Ai <si,...,A m < s m ) 

= z2 P ( i lT- -^'m; (Sl,00), (s 2 ,Si], . . . , (s m ,S m _i]), 
I 

where I consists of all sets of m nonnegative integers ii,...,i m such that 

i\ = and ij + \ < j — ij i\ for j = 1, . . . , m— 1. In the special case where 

only the largest eigenvalue is analyzed, we have Pr(Ai < si) = P(0, (s\, oo)) = 
-^rij=i[l ~~ X(si,oo)(^j)]) where XjW denotes the indicator function of the 
set J and the expectation is with respect to the joint distribution of Ai, . . . , X p . 
For this special case, Baik, Ben Arous and Peche [4] perform the second 
step of the above method. Assuming that n>p, they show that EJJ^ =1 [1 — 
X(si,oo)(-\j)] equals the Fredholm determinant det(l — K njP ), where K n)P is an 
operator acting on L 2 ((si, oo)) with a kernel that has a convenient integral 
representation. 

To establish the asymptotic distribution of the largest eigenvalue of 
Wc(E p /n,n), El Karoui [9] starts from Baik, Ben Arous and Peche [4] result. 
He then finds centering and scaling sequences [i n ^ and a n , p such that, as 
both n and p tend to infinity, the recentered and rescaled version of the oper- 
ator K ntP , Sn tP , converges in the trace-class norm to an operator E ■ Ai - E~ l 
acting on L 2 ((si, oo)), where E is an operator which entails multiplication 
by a certain function and Ai is an integral operator with the Airy ker- 
nel Ai(x,y) = J Ai(x + u)Ai(y + u) du, where Ai(x) is the Airy function. 
Since the Fredholm determinant is continuous with respect to the trace- 
class norm and since it is invariant with respect to conjugation, El Karoui 
[9] concludes that the distribution of the centered and scaled largest eigen- 
value of Wc(S p /n, n) converges to the distribution defined by det(I — Ai), 
which is TW 2 (see [26]). 

A careful inspection of El Karoui's [9] proofs reveals that they only involve 
the assumption that n/p remains in a compact subset of [1, oo) to be able to 
use the determinantal representation of the cumulative distribution function 
of the largest eigenvalue of Wc(T, p /n,n) established by Baik, Ben Arous 
and Peche [4]. Therefore, we first extend Baik, Ben Arous and Peche [4] 
to the case of a singular complex Wishart matrix. Somewhat unexpectedly, 
we find that not only the determinantal representation of the cumulative 
distribution function of the largest eigenvalue of Wc(T, p /n,n), but also all 
the rest of their results, hold for the singular Wishart case without any extra 
qualifications. Our extension of [9] easily follows from the extension of [4]. 

The rest of this paper is organized as follows. In Section 2, we prove our 
generalization of [4]. Section 3 generalizes [9] to the case of several eigen- 
values of a possibly singular complex Wishart matrix. Section 4 contains 
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an application to the determination of the number of common risk factors 
in stock return data. Section 5 concludes. The Appendix contains proofs of 
some of the tangential statements of this paper. 

2. Extension of Baik, Ben Arous and Peche [4]. In this section, we extend 
Baik, Ben Arous and Peche [4] analysis to the singular situation when n <p 
and to the case of several of the largest eigenvalues of Wc(Ep/n,n). Note 
that for a general positive integer m, we have 



P{i\i • • • ; im'i Jl > • ■ • j Jm) 

(3) 

Q«lH Mm P 



eT] 



i +£(**- i)*j fc (Aj 

j=l L fc=l 



(see, e.g., formula (4.1) in [27]). Below, we establish a determinantal rep- 
resentation of £?rjf=i[l + J2k=i( z k — l)xjfc(-^i)] which does not depend on 
whether n < p or n > p. 

First, for the case n < p, we will find a convenient expression for the joint 
density of the nonzero eigenvalues Ai, . . . , A n of a singular complex Wishart 
matrix Wc(£ p /ti, n). Let 7r,- be the inverse of the jth largest eigenvalue 
of Sp. Define A = (Ai, . . . , A n )', tt = (iri, . . . , 7r p )', A = diag(Ai, . . . , A n ) and 
IT = diag(7ri, . . . , TT p ). As shown in [18], formula (25), the joint density equals 



(4) /(A) 



n . 

const-y(A) 2 TTAr n / e -" tr ( nQiAQ i)(g*dgi), 

=1 JQi€CV(p,n) 



where V(A) = rii<j<i<n(^i — CV(p, n) denotes the complex Stiefel man- 
ifold of p x n matrices with orthonormal columns and (Q* dQ\) is the ex- 
terior differential form representing the uniform measure on the complex 
Stiefel manifold. Throughout the remainder of this paper, "const" denotes 
possibly different constants that may depend on p, n and tt, but not on A. 
Note that 

„ r _— ntr(iIRiA_R*) I r>* jp\ 

(5) / e-"^)(Q^ Ql ) J^ ]m/ 

JQiGCy( P ,n) \o\{U(p-n)\ 

where R\ is a p x n matrix of the first n columns of the matrix R = [Ri, R2], 
and U (p) is the set of all p x p unitary matrices. A proof of (5) can be found 
in the Appendix. Now, since the unitary group is compact, we have 

(6) / e- ntr[URlAR ^(R*dR) = lim [ e - nt < URAeR *\R* dR) , 
jR£U(p) ^®Jr&u{p) 

where A e := diag(Ai, . . . , A re ,£, 2s, . . . ,s(p - n)). 

The integral on the right-hand side of (6) is called the Harish-Chandra- 
Itzykson-Zuber integral (see [14], Appendix A5 and page 648). It can be 
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simplified as follows. Define X e = (Ai, . . . , A n , e,2e, . . . , e(p — n))' and let \ e k 
be the fcth component of vector A e . We then have 

f e -ntr(nRA s R*), R * dlVj 
jR£U(p) 

(7) - 1 

= const •(y(vf)y(A £ ))- 1 det(e-^ A ^) 1 < ii fc < p . 

Here, we assume that all of the 7Tj are different. If some 7Tj are equal, then 
the formula should be changed according to l'Hopital's theorem. 

Let . . . ,i p } be a set of indices equal to the set {1,2, . . . ,p} and such 
that i\ < • • ■ < i n and i n+ i < ••• <i p . Denote the multi-index . . . , i n ) as 
a and the multi-index (i n+ i, . . . ,i p ) as a, and let x a denote (x^ , . . . , Xi n )' , 
x a ( fe ) denote x ik , i s denote (x in+1 , . . . ,x ip )' and a^ife) denote x in+k . Fi- 
nally, let \a\ denote i\ H — • + i n . Then, by the Laplace expansion theorem, 
det(e _n7r:,A£fc )i< : , i fc<p is equal in absolute value to 

(8) £(-l) H det(e-^>H<^<nC^ 

a 

The second determinant in the above sum is a Vandermonde determinant. 
Hence, 

(9) 

Ep—n 
j=i w si(j) ^ e -n7r sCfe )E _ e -n7r a(j) E^ 

l<j<k<p—n 

Further, note that 

n p 

(10) V{\ £ ) = const -e(VV(A)I ' - n) - A*). 

i=l fc=n+l 

Combining (5)-(10) and taking the limit as e — ► 0, we obtain 



/ 

JG 



e - ntT{UQlAQ ^{Q*dQ 1 ) 

i€CV(p,n) 



(11) = const -(y(^)y(A))" 1 

n 

><n A r p E(- i ) |ali/ (^) det ( e " n7rQU)Afe )i<^<n- 

i=l o 

Substituting (11) into (4), we find that the joint density of the nonzero 
eigenvalues A = (Ai, . . . , A n ) of the singular complex Wishart matrix Wic(S p / 
n, n) equals 



(12) /(A) = const •y(A)^(-l) |a| y(vr«)det(e- n7r -w Afc ) 1 



<j,k<n' 
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We are now ready to generalize Proposition 2.1 of [4], which establishes 
a determinantal representation of £TIj=i[l ~~ X(si,oo)(^i)]> to the case of 
E U P j=i[ l + EfcliOfc - 1 )XJ fe (Aj)] and general n and p. 

Proposition 1. For any fixed q satisfying < q < mm{irj}^ =1 , let K U:P 
be the operator acting on L 2 ((0, oo)) with kernel 

KnAv,() = j^ fdzf dwe-^-^<^^(^) n f[^^, 
{2iriy Jr Jt, w-z\wj ^vrfc-z 

where £ is a simple closed contour enclosing and lying in {w : Re(io) < q} 
and r is a simple closed contour enclosing iri, . . . ,n p and lying in {z : Re(z) > 
q}, both oriented counterclockwise. Then, for any real-valued measurable 
bounded function f(x) which equals for any x < 0, 

^n( i+ /( A i)) =det (i+^n, P /), 

where 1 is the identity operator and f is the operator which entails multipli- 
cation by the function /(•). 

Proof. Let us first focus on the case when n < p. The eigenvalues 
A n+ i, . . . , A p then equal zero and we have EJJ^ =1 (1 + f(Xj)) = E ]lj=i(l + 
f(Xj)). Using the equality V(X) = det(A]. _1 )i< : , j fc< n and (12), we obtain 

n 

EH(1 + /(A,)) = const ■ £(_l)l«lF(7r a ) 

j=l a 

— ) 

det(Ar 1 ) 

n 

x det(e- nir -W A * ) J] (1 + /(**)) d\ k . 
k=l 

Using Andreief's [2] identity / • • • / det(/j(xfc)) det(gj(xk)) Elfc dfi(xk) = 
det(/ fj(x)gk(x) dfi(x)) , we find that 

n 

Eliil + fiXj)) 
i=i 

(13) = const .£(-i)Wy(7r a ) 



xdet^ °(l + /(A))A J '- 1 e- n ^w A (iA^ 



l<j,k<n 
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Now, define v = n — p (note that it is less than zero) and set 
fO, Xj<-v, 

4>j W = \ n3+U X j-l+v p -nq\ ff • > _ v for j = l,... ,p, 

$ k {\) = e- n ^-^\ farfc = l,...,p, 
for any < q < min{7Tj}^ =1 . Also, let 

A = i^jk)l<j,k<p, Aj k = TT k J . 

Note that Aj k = J °° ^(A)$fe(A) d\ for j > —v. Since ^4 is a simple modifi- 
cation of a Vandermonde matrix, we have 

(14) detA = fl-l T J] (tt- 1 -^ 1 ). 

j = l l<J<fc<p 

Thus, ^4 is invertible when all of the TTi are distinct. 

Next, define operators B : L 2 ((0, oo)) -> Z 2 ({1, . . . and C:Z 2 ({1,..., 
p})^L 2 ((0,oo) by 

B(j,\) = cf> j (\), C(X,k) = $jfc(A) 

and let / be the operator L 2 ((0,oo)) — ► L 2 ((0, oo)) which entails multiplica- 
tion by a real- valued measurable bounded function f(x). Then, since 

"— 1„— nTT-v/feiA . 



(15) /(A)A^ 1 e -^) A ( iA = ( J B/C)(i-^,a(fc)) 
for j = 1, ... , n, we find, using (13), that 

n 

(16) E n(l + /(A,)) = const .£(-1)1*1^) det^^ + B^fC^), 

j=l a 

where A( a ) is a submatrix of A that consists of the intersection of its columns 
numbered a(l), . . . , a(n) with its last n rows, that is, = {Aj_ Vja n.\)x<j^< n . 
Similarly, B^ is an operator with the kernel that consists of the last n ele- 
ments of the kernel of B and is an operator with the kernel that consists 
of elements, numbered a(l), . . . , a(n), of the kernel of C. 

Note that the right-hand side of (16) is proportional to the Laplace ex- 
pansion of det(A — BfC). To see this, use (15) and observe that the kernel 
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of A — BfC has the following form: 



r(i) 



V 1 1 r(n) 

Hence, 



/(A)e~ n7riA dA 



/(A)A"- 1 e-" 7riA dA 



r(i) 



p r(n) 



/(A)A"~ 1 e-™ 7r " A dA 



^ n ( : + /( a j)) = c ° nst • det (^ + • b / c ) 

3=1 

= const • det(A _1 ) det(l + A~ l BfC). 

So, interchanging the order of the composition of operators under the deter- 
minant, we find that 

n 

E + f(\j)) = const • det(l + CA~ l Bf). 

3=1 

By setting /(•) equal to minus the indicator function of (s,oo) and letting 
s — > oo in both sides of the above equality, we find that "const" in the above 
formula equals 1. Thus, 

n 

E II (1 + MO) = det (l + CA-'Bf). 

3=1 

From this point on the proof of Baik, Ben Arous and Peche [4] progresses 
practically without changes. We will provide it here in order to make this 
paper self-contained. The kernel of the operator CA~^B in the above deter- 
minant is 



CA^Biri, C) = J2 C (V, k){A- l B)(j, Q, V,C>0. 

3=1 



Further, from Cramer's rule, 
(17) 
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where A is the matrix given by A with the jth column replaced by the 
vector ((f>i((), . . . ,(f)p(())'. To compute det note that 

1 f e w \ if a is positive integer, 

o~- I — dw = \ r a 

(0, it o is zero or negative, 

where £ is any simple closed contour with counterclockwise orientation en- 
closing the origin 0. By replacing w — ► £rau; and setting a = k + v, this implies 
that 

C~ {k - 1+U) f r nw n ^ if *:>-!/, 



27rz 7s (raw) 
and therefore that 



0, ]fk<-u, 



Substituting this formula for 4>k(C) i n the jth column of A^> and pulling 
out the integrals over w, we obtain 

detA^\C) = — [ e^ w - q) Aet{A'{w))ndw, 
2m Jy, 



where the entries of A'(w) are A' ab {w) = l/p^ +u , where pi = 7Tj, when b ^ j 
and pi, = w when b = j. Hence, by the formula for a Vandermonde determi- 
nant, 

k=£j k l<a<b<n 

and so, using (14) and (17), we obtain 

2m Jy, rr. 7Tj — 7r& 

However, for any simple closed contour Y that encloses wi,...,tt p but ex- 
cludes w and which is oriented counterclockwise, 



2m J r w-z ^ z - vr fc ^ J ^ vrj - vr fc 



Therefore, we find 
CA- l B{^Q 

= / dz [ dwe -Vn(z- g )+Cn( W - q )^_ fj W^kf^ 5 

(2m) 1 Jr Jy, w — z £J- z — 7T& \w 
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which completes the proof when all of the ttj are distinct. When some ttj 
are equal, the formula for the kernel CA~ 1 B(n, £) follows by taking proper 
limits and using PHopital's theorem. 

For the case when n>p and f{x) equals minus the indicator function for 
the interval (s,oo), where s S R, the proposition is equivalent to Proposition 
2.1 of [4]. Extending Baik, Ben Arous and Peche's [4] proof to the case of 
general f{x) while keeping their assumption that n > p is straightforward. 
To save space, we omit such an extension from the proof. □ 

In the next section, we will use Proposition 1 to extend the results of 
El Karoui [9] to the case of several of the largest eigenvalues of a complex 
singular Wishart matrix. Concluding this section, we would like to note that 
our extension of Proposition 2.1 of [4] implies that the main results of that 
paper, namely Theorems 1.1 and 1.2, hold under the assumption that n/p 
(M/N in the notation of [4]) remains in a compact subset of (0,+oo) as 
both n and p tend to infinity. This assumption relaxes Baik, Ben Arous and 
Peche's [4] requirement that n/p remains in a compact subset of [l,+oo). 
The Appendix contains a brief list of changes that should be made to the 
proofs of Baik, Ben Arous and Peche [4] (beyond the extension of Proposition 
2.1) to justify such a relaxation. 

3. Extension of El Karoui [9]. In this section, we will prove that the 
joint distribution of the first m scaled and centered eigenvalues of a complex 
Wishart matrix PVx(S p /n, n) weakly converges to the m-dimensional joint 
Tracy-Widom distribution. Such a convergence takes place in both cases 
n>p and n<p. The scaling and centering sequences are the same for all of 
the m eigenvalues and have the form proposed in [9]. 

Proposition 2. Let H p be the spectral distribution of T, p . Let c n ^ p be 
the unique solution in [0,7Ti) of the equation 

J V 1 - Ac n>p J y p 

Assume that n/p remains in a compact subset of (0,oo), limsupvr^ 1 < oo, 
liminf-Tr" 1 >0 and limsupc n)P /"7ri < 1. Define 

Cn,p \ n J L -\C n ^p / 

and 
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Then, as n and p tend to infinity, the joint distribution of the first m cen- 
tered and scaled eigenvalues cr~p(Ai — [i n ,p\ ■ ■ ■ ,ff^i(A m — /U n ,p) of matrix 
Wc(Tip/n,n) weakly converges to the m- dimensional joint Tracy-Widom 
distribution. 



PROOF. For a short proof of the uniqueness of c„ jP , see [9], formula 
(11) and the discussion that follows the formula. Let us first prove that 
liminfc njP > and limsupc niP < oo. Since, by assumption, liminfn/p > 0, 
there exists 7 > such that n/p> 7 2 . We have 

1 2 <-=I{ ^ ) 2 dH p (\)<( Cn ' p/n ] 

p J \l-\c n ^J y Vl-Cn,pM 



and, therefore, c„ jP > vrij^. This implies that liminfc njP > because, by 

assumption, limsupvr^ 1 < 00. Further, since c n ^ p <C tv\ ^ 7r p , the assumption 
that liminfvr" 1 > implies that limsupc n)P < 00. Note that the facts just 
established, that liminf c„ iP > and limsupc n . p < 00, and the assumptions 
that limsupc njP /7Ti < 1 and that n/p remains in a compact subset of (0,oo) 
imply that n np remains in a compact subset of (0, 00) and that <r njP decays 
to zero as fast as n~ 2 / 3 when n tends to infinity. 

Now, let x\ > ■ ■ ■ > x m be any real numbers. Since [i n ^ p remains in a 
compact subset of (0,oo), whereas a n ^ p tends to zero as n — > 00, there 
exists N > such that for any n > N, Sj = n np + o- n , p Xi (i = 1, . . . ,m) 
are positive numbers. In what follows, we will always take n > N. Con- 
sider the function f Zu ... )Zm (x) = EfcLiOfc ~ l)XJ h ( x )i where Ji, . . . , J m equal 
(si, 00), (s2, si], ... , (s m , s m -i], respectively, and z\,...,z m are any complex 
numbers. According to (2) and (3), 

Pr(Ai < si,...,X m < 

y, 1 d il+ - +im 

■,im}€l 



(18) 

^ ii\--^w}-dz\ 1 ---dzl 



El[[l + f Zl ,..., Zm (\ j )} 



21= 



By Proposition 1, we can equivalently say that Pr(Ai < si,...,A m < s m ) 
can be expressed as the sum of a few coefficients in the power expansion of 
det{l + K n>p f zlr .. >Zm ). 

Consider a rescaled and recentered version of the kernel of the operator 



K 



Sn,p{, u ^ ^) — 0~n,pK-n,p\[J"n,,p ~i~ 0~n,pU, fJ-n,p 0~n, p v) ■ 
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Let Sn iP be an operator with kernel S njP (u,v), which acts on L 2 ((x m , oo)). 
Note that 

det(l + K n>p f Zu ... tZm ) = det(l + S ntP g zlv .., Zm ), 

where g zl ,..., Zm (x) = EfcLi(^fc- 1 )X-R fc O*0 and Ri,...,R m equal (xi,oo), {x 2 ,Xi\, 
. . . , (x m , sc TO _i] , respectively. 

Under the conditions of Proposition 2 and an additional condition that 
n>p, El Karoui [9] proves that there exists e > such that S n>p converges 
in trace-class norm to an operator E ■ Ai • E~ l from the trace class, where 
E is the operator which entails multiplication by e~ £X and Ai is an inte- 
gral operator acting on L 2 ((x m , oo)), which has the Airy kernel Ai(x,y) = 
J Ai(x + u)Ai(y + u) du. The strategy of his proofs is the same as that of Baik, 
Ben Arous and Peche [4]. First, Proposition 1.2 of [4] is used to represent 
Sn tP in the form A n:P ■ B n<p , where A n<p and B n<p are operators with kernels 
An jP (x, y) = A riiP (x + y) = J r e nfn, P (^+y) dz and B n>p (x, y) = B n>p (x + y) = 
J s e ri9n - v<yZ ' x+v ^ dz. By the logic of steepest descent analysis, as n and p tend 
to infinity only the behavior of f n>p and g n ^ p around their respective maxima 
matters for the asymptotic behavior of A ntP {x) and B n ^ p (x). This fact is then 
exploited to show the convergence of A niP (x) and B njP (x) to e~ £X Ai(x) and 
e ex Ai(x), respectively, which implies the convergence of S UiP to E • Ai - E^ 1 . 

A careful reading of El Karoui's [9] proofs reveals that he needs the ad- 
ditional condition n>p only to be able to use Proposition 1.2 of [4]. For 
all other purposes, the inequality n/p > 1 in his proofs can be replaced 
by n/p > 7 2 > without changing the validity of the proofs. Therefore, El 
Karoui's [9] result and our Proposition 1 imply the convergence of S n:P to 
E ■ Ai ■ E~ l without the extra condition that n>p. 

Since trace-class operators form an ideal in the algebra of bounded linear 
operators, the operators S UtP ■ g Zl ,...,z m and E ■ Ai ■ E~ l ■ g Zl ,..., Zm must be 
from the trace class. Further, since g Zl ,...,z m ^ s a bounded function for all 
zi, ...,z m , \\S n>p ■ g Zl ,...,z m - E- Ai- E' 1 ■ g Zl) ..., Zm \\i is less than or equal to 
\\S n ,p — E • Ai - E~~ x \\i\\g Zlt „^ Zm \\, which converges to zero as n and p tend to 
infinity. Here, ||-K"||i denotes the trace-class norm of the operator K and the 
above norm inequality follows from the inequalities of Theorem 1.6 in [22]. 
Hence, 5 n>p • g Zl ,..., Zm converges to E ■ Ai ■ E^ 1 • g Zl ,..., Zm in the trace-class 
norm. 

Now, since the Fredholm determinant is continuous with respect to the 
trace-class norm, det(l + S n>p ■ g Zlr ,. lZm ) converges to det(l + E ■ Ai ■ E^ 1 ■ 
g zi ,...,z m ) f° r any zi) • • • i z m - Further, since E~ l and g zll ,., >Zm commute, det(l + 
E- Ai - E' 1 ■ g zlr .. tZm ) = det(l + E ■ Ai ■ g zu ..., Zm ■ E" 1 ). However, determi- 
nants are invariant with respect to conjugation which leaves an operator in 
the trace class (see Remark 2.1 in [4]). Therefore, we have 

det(l + S niP ■ g Zl ,..., Zm ) det(l + Ai ■ g zl ,..., Zm ) 
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for any zi,...,z m . 



Since det(l + S ntP • g Zl ,...,z m ) exactly equals E(l\ p j=1 (l + J2k=i( z k - 1) X 
Xj fe (Aj))), it is a finite order polynomial in z±, . . . , z m and hence an analytic 
function of z±, . . . , z m . Further, as follows, for example, from formulas (1.30) 
and (1.32) in [23], 

(19) \det(l + S np - g z 2m )| < const -exp( max \zj - 1| ||5„ p ||i ) 

\j=l,...,m J 

(see also Lemma 3.3 of [22] for the case when m = 1). Since converges 
in the trace-class norm to E • Ai ■ i£~ , there exists a constant M such 
that ||<SVi,p[|i < M for all n and p. This fact, together with (19), implies 
that det(l + S HtP ■ g zi ,...,zm) form a normal family of analytic functions (see 
[21], page 5). Hence, the convergence of det(l + 5„ jP • g Zl ,...,z m ) to det(l + ^4i- 
9zi,...,z m ) is uniform on any compact set in C m and therefore all derivatives of 
det(l + S nsP ■ g Zl ,..., Zm ), and thus all derivatives of E]J p j=1 [l + /z lr ..,z m (Aj)], 
converge to the corresponding derivatives of det(l + Ai ■ g zi ,...,z m ) at z\ = 
■■■ = z m = 0. 

As shown by Johansson [12] [see his formulas (1.19), (3.46) and (3.48)], 
F(xi , . . . , x m ) , defined as 

F(x\ , • • • , Xm) 



I QilA him 

V — T — : ^det(l + ^-o zl 



-z m =Q 



{il,—,im}&I 

is the distribution function for the m-dimensional joint Tracy- Widom dis- 
tribution. Using (18) and the convergence of the derivatives of -ErjjLiIl + 
fz 1 ,...,z m (^j)] to those of det(l + Ai • g Zl ,...,z m ) just established, we conclude 
that 

Pr(— — <x\,...,— — <Xm) ->■ F(xi,...,x m ). 

Since F(xi, . . . , x m ) is a continuous function, such a convergence implies that 
the joint distribution of cr~p(Ai — n n ,p), ■ ■ ■ , &n}p{^m — ^n,p) weakly converges 
to the m-dimensional joint Tracy- Widom distribution. □ 

Concluding this section, we note that since [9] used the assumption that 
n/p > 1 only to be able to use Proposition 2.1 of [4], our finding that propo- 
sition holds for n < p proves that all of El Karoui's [9] results are valid for 
n/p remaining in a compact subset of (0,oo). 

4. Application. In this section, we will see how Proposition 2 can be 
used in the analysis of excess stock return data generated by the approxi- 
mate factor model (1). An approximate factor model for asset returns forms 
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the core of Chamberlain and Rothschild's [8] extension of the arbitrage pric- 
ing theory (APT) of Ross [20]. The APT, one of the most important finance 
theories, shows that asset prices must be well explained by covariances of as- 
set returns with a few common risk factors. An important practical question 
is how many such common factors exist. 

This question has attracted considerable research attention. Roll and Ross 
([19], page 1092) find that "at least three factors are important for pricing, 
but that it is unlikely that more than four are present." Brown and Weinstein 
([5], page 713) "find evidence that there may be as few as 3 economywide 
factors, and certainly no more than 5 if the APM (arbitrage pricing model) is 
correct." Trzcinka [28] finds that there may be between one and five common 
risk factors. Connor and Korajczyk [7] report one or two factors in non- 
January months, but three to six factors for January returns. Huang and Jo 
([10], page 988) find that " the evidence supports only a small number of 
factors, generally one and at most two." Bai and Ng [3] estimate the number 
of common factors in stock returns to be two. Two is also the preferred 
number of factors in [16]. Makarov and Papanikolaou [13] find evidence that 
there are four factors in stock returns. 

In general, researchers find a small number of factors in the approximate 
factor model for excess stock returns. Often, they are uncertain about their 
point estimates. This uncertainty concerning point estimates is also reflected 
in the fact that different researchers often provide conflicting estimates. Even 
though the uncertainty is well recognized, it has never been formally quan- 
tified. Below, we will try to quantify this uncertainty. More precisely, we 
will find an asymptotic 95% confidence set for the number of factors by in- 
verting a statistical test for the number of factors partially developed in a 
companion paper [17]. 

In the companion paper, I am interested in testing the null hypothesis 
of ko factors versus the alternative that the number of factors k is larger 
than ko but smaller than /c max + 1; where fc max is an a priori maximum num- 
ber of factors. I assume that the real-valued data {£,it}i<i< P ,i<t<N , where 
N = 2n, is generated by model (1), where the vectors of idiosyncratic com- 
ponents {r]it}i<i< p are i.i.d. Gaussian N(0,T, p /2) and independent of fac- 
tors {F t }i<t<N ■ To test the hypothesis, I propose first to construct a new 
complex-valued data set {iu}i<i< P ,i<t<n, where f if = £ it + y^&.t+n, and 
then to compute a test statistic maXk 0< i<k max ('Yi ~7i+i)/ (li+i ~ li+2) , where 
7i is the ith largest eigenvalue of the sample covariance matrix of the new 

dataset (this matrix is defined as ^73-, where X = {£it}i<i< P ,i<t<n)- Theo- 
rem 3 in [17] shows that if the factors' explanatory power is strong enough 
so that 71, . . . , 7^ increase faster than p 2//3 , then, under the null hypothesis, 
the asymptotic distribution of the proposed test statistic as n and p increase 
in such a way that n/p remains in a compact subset of (0, 00) is the same 



16 



A. ONATSKI 



as the asymptotic distribution of max <i<fc max -fc (Aj - A i+ i)/(A i+ i - A i+2 ), 
where Aj is the ith largest eigenvalue of a Wc(T, p /n,n) matrix. Under the 
alternative, the test statistic explodes in probability as n and p increase. 

Proposition 2 implies that the asymptotic distribution of maxo<j<fc max -fc (Aj — 
At+i)/(Ai+i - A i+2 ) equals the distribution of max 0<i < fcmax _ fco (m - Mi+i)/ 
- m+2), where m, . . . , Hk ma , x -k have the joint (/c max - k ) -dimensional 
Tracy- Widom distribution. This result allows us to tabulate the critical val- 
ues of the asymptotic distribution by using Monte Carlo simulations of large- 
dimensional matrices from the GUE to approximate the joint Tracy- Widom 
distribution. We approximate the joint 10-dimensional Tracy-Widom dis- 
tribution of type two by the distribution of the 10 largest eigenvalues of 
a 1000 x 1000 matrix from the Gaussian Unitary Ensemble. We obtain an 
approximation for the latter distribution by simulating 30,000 independent 
matrices from the ensemble and numerically computing their first 10 eigen- 
values. 

Table 1 contains the critical values of our test for & max — fco = 1, 2,...,8. 
For example, in the table, the approximate 95% critical value of the test of 
three factors, versus the alternative 3 < k < 10, is in the 5th row (counting 
from the bottom up) and the 2nd column (counting from the right). It equals 
8.29. 

Our test procedure can be interpreted as formalizing the widely used em- 
pirical method to determine the number of (classical) factors based on the 

Table 1 

Approximate percentiles of the test statistics for the tests of ko factors, versus the 
alternative of more than ko but less than fc max + 1 factors 



kin a x ko 



% 


1 


2 


3 


4 


5 


6 


7 


8 


50 


1.27 


1.95 


2.30 


2.54 


2.74 


2.92 


3.09 


3.24 


60 


1.53 


2.24 


2.59 


2.88 


3.10 


3.31 


3.49 


3.65 


70 


1.86 


2.61 


3.01 


3.32 


3.59 


3.82 


4.01 


4.20 


80 


2.37 


3.19 


3.65 


4.02 


4.32 


4.59 


4.83 


5.05 


85 


2.75 


3.62 


4.15 


4.54 


4.89 


5.20 


5.45 


5.70 


90 


3.33 


4.31 


4.91 


5.40 


5.77 


6.13 


6.42 


6.66 


91 


3.50 


4.49 


5.13 


5.62 


6.03 


6.39 


6.67 


6.92 


92 


3.69 


4.72 


5.37 


5.91 


6.31 


6.68 


6.95 


7.25 


93 


3.92 


4.99 


5.66 


6.24 


6.62 


7.00 


7.32 


7.59 


94 


4.20 


5.31 


6.03 


6.57 


7.00 


7.41 


7.74 


8.04 


95 


4.52 


5.73 


6.46 


7.01 


7.50 


7.95 


8.29 


8.59 


96 


5.02 


6.26 


6.97 


7.63 


8.16 


8.61 


9.06 


9.36 


97 


5.62 


6.91 


7.79 


8.48 


9.06 


9.64 


10.11 


10.44 


98 


6.55 


8.15 


9.06 


9.93 


10.47 


11.27 


11.75 


12.13 


99 


8.74 


10.52 


11.67 


12.56 


13.42 


14.26 


14.88 


15.25 
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visual inspection of the scree plot introduced by Cattell [6]. The scree plot 
is a line that connects the decreasing eigenvalues of the sample covariance 
matrix of the data plotted against their respective order numbers. In prac- 
tice, it often happens that the scree plot shows a sharp break where the 
true number of factors ends and "debris" corresponding to the idiosyncratic 
influences appears. Our test statistic effectively measures the curvature of 
the scree plot at a would-be break point under the alternative hypothesis. 
When the alternative hypothesis is true, the curvature asymptotically tends 
to infinity. In contrast, under the null hypothesis, the curvature has a non- 
degenerate asymptotic distribution that does not depend on the model's 
parameter T, p . 

To construct a 95% confidence interval for the number of common factors 
in excess stock returns, we use data provided by the Center for Research 
in Security Prices (CRSP) on monthly returns on p = 972 stocks traded on 
the NYSE, AMEX and NASDAQ during the period from January 1983 to 
December 2006. The data set includes those and only those companies for 
which CRSP provides monthly holding period return data for all months 
in the studied time interval. To obtain the excess returns on the stocks, 
we subtract the one-month risk-free rate provided by CRSP from the stock 
returns. 

Since previous empirical research suggests that the number of common 
risk factors may be different in January and non-January months, we omit 
January data, which leaves N = 264 time observations of real-valued data. 
To obtain a complex- valued data set, we divide the real- valued data into two 
parts: the first containing all observations from February 1983 to December 
1994 and the second containing all observations from February 1995 to De- 
cember 2006. We then add the data from the first subperiod to the product 
of the imaginary unit and the data from the second subperiod. Hence, the 
dimensionality of the complex- valued data set is p = 972, n = 132. Note that 
p is larger than n. 

We maintain the assumption that the true number of factors is strictly 
less than seven. We take seven as an upper bound for the number of fac- 
tors because it is consistent with conclusions of previous studies. If a 5%- 
asymptotic-size test of the null hypotheses of j factors, versus the alternative 
that the number of factors is larger than j but no larger than seven, does 
not reject the null hypothesis, we will include j < 7 in the asymptotic 95% 
confidence set for the number of factors. 

Table 2 contains the first nine eigenvalues 71 , . . . , 79 of the sample covari- 
ance matrix of the complex-valued data set; the quantities (7* — 7i+i)/ (7i+i — 
7 i+2 ), i = 1, . . . , 7; the test statistics max fco<i < 7 (7i - 7i+i)/ (7*4-1 - 7^+2) for 
the tests of the null hypotheses of k^ = 0, ko = 1, • • • , ko = 6 factors, versus 
the alternatives that the number of factors is larger than ko but no larger 
than 7; and the corresponding 95% critical values taken from Table 1. 
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Table 2 

The largest eigenvalues of the sample covariance matrix of the complex-valued data and 
the test statistics for the tests of hypotheses that the number of factors equals ko, versus 
the alternative of more than ko but no more than 7 factor 



7< 
n — fj+i 



1 


2 


3 


4 


5 


6 


7 8 


9 


3.99 


1.16 


0.76 


0.66 


0.50 


0.48 


0.40 0.37 


0.34 


7.14 


3.77 


0.65 


12.73 


0.15 


2.98 


1.18 





fc 1 2 3 4 5 6 

maxt » < ^ 7 ¥T=¥7 12 ' 73 12 ' 73 12 ' 73 12 ' 73 2 ' 98 2 - 98 L18 
Critical values * 8.29 7.95 7.50 7.01 6.46 5.73 4.52 



We reject the null hypotheses of 0, 1, 2 and 3 factors by the tests of asymp- 
totic size 5%, but cannot reject the null hypotheses of 4, 5 and 6 factors. 
Hence, my 95% confidence set for the true number of common factors in the 
excess stock returns is {4,5,6}. This set intersects the range of estimates 
proposed by Brown and Weinstein [5], Roll and Ross [19] and Trzcinka [28]. 
It includes the point estimate 4 found in [13]. It is disjoint with the set of es- 
timates reported by [7], [10] and [3]. The present author's previous estimate, 
2, reported in [16], is not included in the set. 

The 95% confidence set {4, 5, 6} should, perhaps, appeal to the propo- 
nents of multifactor financial models. The good news is that and 1 do not 
enter the set. However, this result should be taken with a grain of salt. The 
reason is that the rejection of the null hypothesis of 0, 1, 2 and 3 factors may 
be due to a failure of some of the primitive assumptions of the tests. For ex- 
ample, no one would truly believe that the idiosyncratic components of the 
excess stock returns are Gaussian and independent over time. Soshnikov's 
[24] results on the universality of the Tracy-Widom limit for the largest 
eigenvalue of sample covariance matrices require that n/p approaches 1 as 
n tends to infinity and assume that the tails of the distribution of the data 
points are relatively thin. For the financial data, the tails of the distribution 
may be heavy and the above test for the number of factors will then be 
invalid. In fact, the approximate factor model will no longer be a plausi- 
ble description of the data because it assumes the existence of the second 
moments of the data. 

Another discouraging possibility is that the asymptotics which the tests 
rely on may poorly approximate the finite-sample situation. Perhaps most 
importantly, the result of [17] that the asymptotic distribution of 
max fco<i < fcmax (7i-7i + i)/(7i + i -7 i+2 ) is the same as that of maxo<i<fc max _fc (Aj- 
Aj + i)/(Aj + i — Aj+2), where A, is the ith largest eigenvalue of a Wc(S p /n,n) 
matrix, substantially uses the fact that if the true number of factors is &o, 
then under the null hypothesis, 7fc /7fc 0+ i increases faster than p 2 / 3 . A ca- 
sual inspection of the second row of Table 2 reveals that none of the ratios 
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7i/7i+l are large. Hence, to use the obtained 95% confidence set comfort- 
ably, one should check whether the asymptotic requirements in [17] can be 
relaxed. Such a check is left for future research. 

5. Conclusion. In this paper, it has been shown that the joint distri- 
bution of the centered and normalized several largest eigenvalues of a p- 
dimensional complex Wishart matrix W<c{Sl,n) converges to the joint Tracy- 
Widom distribution as n and p tend to infinity in such a way that n/p re- 
mains in a compact subset of (0,oo). This result extends [4] and [9] in two 
directions. First, several of the largest eigenvalues, as opposed to the single 
largest eigenvalue, have been analyzed. Second, and most important, n is 
allowed to be smaller than p, a situation corresponding to Wc(£l,n) being 
a singular matrix. 

It has also been shown that all results of Baik, Ben Arous and Peche [4] 
and El Karoui [9] remain true if their assumption that n/p remains in a 
compact subset of [l,oo) is replaced by a less restrictive assumption that 
n/p remains in a compact subset of (0, oo). 

Finally, it has been demonstrated how the theoretical result of this paper 
can be used to find a 95% confidence set for the number of common factors 
in excess stock returns. The established confidence set is {4, 5, 6}. Such a set 
formally quantifies the uncertainty concerning the true number of factors in 
excess stock returns evident in previous studies of the number of factors. The 
set supports some of the previous studies, but not others. Possible drawbacks 
in the proposed methodology of obtaining the 95% confidence set have been 
pointed out, suggesting some directions for future research. 

APPENDIX 

Proof of (5). Consider the following analytic homeomorphism g (see 
[11] for a useful summary of concepts from differential geometry) of almost 
all of the U(p) on almost all the product CV(p,n) x U(p — n). Let g{R) = 
{Qi,S}, where Qi = R\, S = H Ri R 2 , H Rl is such that [Rt,H Rl ] £ U(p) 
and the elements of H Rl are analytic functions of R\. Columns of H Rl can, 
for example, be obtained by applying the Gram-Schmidt orthogonalization 
procedure to the projections of the first p — n vectors of a fixed basis in C p 
onto the p — n-dimensional subspace orthogonal to the columns of R\. Such 
a construction will work for all R such that the first p — n vectors of the 
fixed basis and the columns of R\ are linearly independent. Hence, it will 
work for almost all of the U(jp). The inverse of g is given by g~ 1 ({Qi, S}) = 
[Qi,Hq 1 S]. 

The homeomorphism g maps the differential form (R* dR) = (RldRi) x 
(R^dR 1 )(R^dR 2 ) to the product of forms (Q\dQi), {S*H* Qi dQi) and 
(S*H^d(H Ql S)). We have (CRdQ 1 )(S*H2 )l dQ 1 ) = (Q?dQi)|S| x 



20 A. ONATSKI 

(Hn 1 dQi) = (Q*dQi), where Q = [QijHq^. Further, the form 
(S*Hq i d(HQ 1 S)) can be represented as a sum of (S*Hq i dHQ 1 S) and (S* dS). 
But the product of (Q* dQ\) with (S*Hq i dHQ 1 S) is zero because (Q* dQi) 
is a form of maximum degree on the Stiefel manifold. Therefore, 

f e -tr(IIflxA^) (irdR) 
JR&U{p) 

= [ [ e~ tr{UQlAQ ^(Q*dQ 1 )(S*dS) 

Js<=U(p-n) JQiGCy(p,n) 



Vol{U(p-n)} [ e- tl(UQlAQ ^(Q*dQ 1 ). 

JQi&CV(p,n) 



□ 



A list of changes to the proofs in Baik, Ben Arous and Peche [4]. Our 

extension of Proposition 2.1 of [4] to the case where n/p < 1 (M/N < 1 
in their notation) implies that the main results of that paper, contained 
in Theorems 1.1 and 1.2, hold if the requirement that M/N remains in a 
compact subset of [l,+oo) as both N and M tend to infinity is relaxed to 
the requirement that M/N remains in a compact subset of (0, +oo). Here is 
a list of few extra necessary changes to the proofs of Theorems 1.1 and 1.2. 
We use the notation of [4]. 

1. Formula (134) should be complemented by the following statement: 
Clearly, T x (t) > for < 7 < 1. 

2. The statement after formula (139) should be changed to 1 < fj, = (7 + 
l) 2 / 7 2 <oo. 

3. Formula (141) should be changed to 7 < 7 < 70 for fixed 70 > 1 and 
0<7< 1. 

4. Formula (143) should be changed to < 5 < min{ , / 1+ T/ 1+ ^ } , 

C := (1+ ^ )4 (l + 7 2 7o 4 )- 

5. Formula (144) should use definition of Co from change 4 above. 

6. Formula (177) should be complemented by the following reasoning: 
Define 

Ti(i) = (7 + 1)V + ( 7 2 - l)t + 2 7 . 

Note that 

r i( forl/5< 7 <l, 



^2(7 + 1^ 

e[ °' Pcl lr 1 (2p c ), for0< 7 <l/5. 

But T x { j^L ) > 6/25 for 1/5 < 7 < 1 and Ti(2p c ) = Q~j 2 for < 7 < 1/5. 

Hence, T x (t) > for < 7 < 1. 

7. Formula (237) should be changed in the same way as formula (141). 
See change 3 above. 
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8. Formula (239) should be changed to < 5 < min{^, 2 (i+ 70 ) > ibf}' @i '■= 
8/ l , (l+7o) 3 ^ 

9. Formula (241) should be changed to \f 3) (s)\ = \^- j2(s 2 _ ir \ < 7^7^ + 

2 < 16 , 16(1+70 ) 3 _ 
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